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We introduce a new method for detecting scaling in time series. The method 
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uses the properties of the probability flux for stochastic self-amne processes and is 
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called the probability flux analysis (PFA). The advantages of this method are: 1) 
it is independent of the finiteness of the moments of the self-amne process; 2) it 
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£>y does not require a binning procedure for numerical evaluation of the the probability 
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density function. These properties make the method particularly efficient for heavy 
tailed distributions in which the variance is not finite, for example, in Levy a— stable 
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^SJ ' processes. This utility is established using a comparison with the diffusion entropy 
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m ■ (DE) method. 
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I. INTRODUCTION 

Over the past ten years there has been an explosion of research papers published in the 
area of complex networks. Some would argue that this torrent of publications is a continu- 
ance of the growing awareness of complexity science whose origin can be traced back to the 
decade of the 1960s. The present emphasis on scale-free networks and their implications for 
scientific disciplines from sociology to neurophysiology had its beginnings with the fractal 
time series analysis of Mandelbrot [lj and the scaling parameter method of Hurst From 
this early appreciation for the limitations of 'normal' statistics to explain and/or character- 
ize complex phenomena we have such terms as 'self-similar', 'self-amne', 'scale invariant', 
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'fractal', and 'multi-fractal' to capture the non-homogeneous and non-isotropic behavior of 
statistical variability {3, 4|. As the mathematical developments became more familiar to a 
generation of scientists the investigations into the manifestations of these effects in biology, 
economics, geophysics, hydrology, neurophysiology, sociology and so on steadily increased 
in number. First there were a few isolated studies, followed by a steady rate of overlapping 
investigations, resulting in what is now a tsunami of monographs, technical papers and pop- 
ular articles. Consequently, even though no universally accepted definition of complexity 
has emerged, the consensus of scientific opinion has converged on the use of scaling as one 
signature of complexity. 

The scientific investigations into scaling have historically been of two kinds: 1) the formal 
mathematics identifying the properties data sequences must possess in order to scale in well- 
defined ways along with the techniques to analyze the data and reveal that scaling and 2) 
the application of those techniques to time series measured in complex phenomena. A 
stochastic process X(t) is said to scale if the time-dependence of the random variable is 
such that X (t)=K~ s X (Kt) with K > and S is a scaling parameter. Statistical methods 
to detect self-affinity (scaling) in time series identify the variable X(t) with the integrated 
time series and not with the time series itself £ (£), such that, 

t 

X(t) = ji{t')dt'. (1) 
'0 

Thus, the original time series £ (f) may be viewed as a sequence of increments of the variable 
X. In this sense many of the data analysis methods are "diffusive" in that the variable X 
is the aggregation of fluctuations denoted by the integral ([I]). Since its introduction in the 
middle 1990s detrended fluctuation analysis (DFA) [5( has become the method of choice for 



detecting scaling particularly in biomedical time series jfj]. The motivation for introducing 
DFA was the presumed applicability of the method to non-stationary time series, particularly 
those with long-time correlations. The scaling methods discussed by Mandelbrot [1] and 
DFA are "variance" methods in that they assume the time dependence of the variance of 
the stochastic variable Var(X(t)) with scaling parameter 5 is algebraic, namely: 

Var(X(t)) oc t 2S . (2) 
However there are scaling processes such as Levy flights [7] for which such relationships 
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are violated because the second moment diverges or Levy walks [7| for which the second 
moment is finite and does satisfy a scaling relation similar to ([2]). The necessity for de- 
tecting the proper scaling of Levy processes was one reason behind the development of the 
diffusion entropy analysis (DEA) method [8] described in Section II. The evident advantage 
of the information or diffusion entropy S(t) over second moment methods is that the former 
is always finite, independently of the behavior of the moments of the probability density 
distribution (pdf) p(x,t). The quantity p(x,t)dx is the probability of finding the trajectory 
X(t) in a infinitesimal neighborhood of x at time t. The divergence of the central moments, 
typical of Levy processes, create difficulties in the numerical determination of the pdf and 
the associated entropy. 

Herein we propose a new procedure for determining scaling, the probability flux analysis 
(PEA) as a general method for scaling detection. Since the PFA uses the cumulative proba- 
bility instead of the density p(x, t) itself as such it is statistically more efficient than DEA. 
The cumulative distribution integrates the pdf and reduces the noise due to the adoption 
of a statistical ensemble with a finite number of trajectories. The PFA method is shown to 
outperform the other methods in the case of Levy processes. The present paper is struc- 
tured as follows. In Section [Til we introduce scaling for the pdf , which is an extension of the 
concept of self-affinity. Ordinarily self-affinity is recovered in the algebraic time dependence 
of a scaling function (3(t) as we show using the DEA method and its numerical implementa- 
tion. We introduce the PFA method in Section IHIl and compare its performance with that 
of DEA on computer generated time series in Section IIVI and subsequently make the same 
comparisons with real world data. Finally, we draw some conclusions in Section [V] 



Consider a one- dimensional stochastic trajectory X(t) whose statistical properties are 
described by p(x,t). The stochastic process represented by X(t) is said to scale if the 
corresponding pdf satisfies the scaling relation 



II. SCALING 
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where the scaling function f3(t) is a function of time and without loss of generality we have 
assumed X(0) = 0. The function F in the above equation is the scaled density since 



J p(x,t)dx = J F{y)dy = l. (4) 

The relation (j3J) proposed here is a generalization of the widespread notion of scaling adopted 
in the literature [s[ Q], which often corresponds to the particular case 

P(t) = t 5 . (5) 



Herein we refer to the validity of the scaling relation together with (j3j) as the "algebraic" 
scaling condition. Geometrically, the scaling condition implies that p(x, t) is invariant under 
the transformations 

X ^^m X X(t) = -§§-X(Kt) (6) 

t^Kt s P( Kt ) 

where K>0 and the symbol = denotes equality in the sense that the pdfs for the variables 

s 

on either side of the equal sign are the same. In the case of algebraic scaling ((3(t)=t 5 ) we 
have 

x — y K~^x 

& X(t) = K~ 5 X(Kt). (7) 

t -)> Kt 



This last set of relationships defines a self-affine transformation 



a. 



A. Scaling detection with Diffusion Entropy Analysis (DEA) 

The pdf defined for the stochastic process X(t) can be used to calculate the information 
entropy. This use of entropy was implemented in discrete form for coding information 
by Shannon 9| and is referred to as the Shannon entropy. Cotemporaniously, this use of 
entropy was introduced in continuous for m by Wiener for studying the problem of filtering 
noise from messages in electrical circuits |l0|. In the analysis here we use the continuous 
form of information entropy 

S(t) = - J p{x, t) log 2 p(x, t)dx, (8) 



which was originally identified as diffusion entropy by Scafetta et al. [8j as a tool for detecting 
scaling in time series. The advantages of using entropy rather than the variance to detect 
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scaling is that entropy provides a more complete description of the stochastic process. The 
two approaches become equivalent only when the pdf is Gaussian. In the general case the 
distribution is not Gaussian and the central moments can and do diverge, as in the case of 
a— stable Levy distributions . If the scaling condition (J3]) is satisfied then it is straightforward 
to show that the entropy reduces to 

S(t) = S + \og 2 f3(t), (9) 

while in the case of algebraic scaling, given by Eq. (jHJ), we have 

S(t) = S + 5log 2 t, (10) 

where the additive constant is defined by the integral over the scaled variable 

So = -J F(y)log 2 F(y)dy. (11) 

The empirical determination of the histograms replacing the pdf's and the discretization 
of the integral (jSJ) is done as follows. A discrete realization Xi (I— 0,1,2, ...,N, and X —0) of 
the stochastic process X(t) is used to create the set of trajectories 

{X k (t)} = {X k+t -X k } k = 0,1,2,..., N-t. (12) 

We call the set {X k (t)} of iV — t + 1 trajectories the Single Trajectory Ensemble (STE) as 
distinct from the Multiple Trajectory Ensemble (MTE) generated using N — t + 1 different 
realization of the stochastic process X(t). The rationale for the STE is that in many real 
world applications one has only a single realization of the stochastic process available. The 
two ensembles are generally thought to produce identical results when the stochastic process 
is stationary and ergodic; even in the non-stationary case the STE and MTE averages are 
thought to be the same provided the effect of local trends can be eliminated, e.g., by using 



the DFA algorithm J5]. However, the equivalence between STE and MTE is lost cvou in the 



stationary and ergodic case 111 ], and caution must be used when interpreting the results 
of scaling analysis that rely on the STE. The procedure described in Eq. (112j) to generate 
the STE has been called the "overlapping" windows method as two trajectory X kl (t) and 
X k2 (t) may share a common profile if \k2— k\\<t. The overlapping windows method is 
often preferred to the non-overlapping windows methods because of the larger number of 



trajectories produced 
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In the present manuscript the STE is used to calculate the histogram for the pdf Pj,&{t) f° r 
finding a trajectory within an interval of size A centered on the value Xj. The bin size A has 
to be sufficiently small to consider the pdf constant within the interval [xj — A/2, Xj + A/2], 
and the integral of Eq. (jHJ) can then be approximated by the sum 

5 W - -^PiA® ^g 2 [p jA (t)} + log 2 A. (13) 

The accuracy of this numerical approximation decreases as t increases since we have only 
N — t+1 of trajectories in the STE. Values of the density p(x, £)~l/(iV— are impossible 
to reproduce correctly. Moreover, if the stochastic process X(t) is such that the probability 
of observing large values, positive and/or negative, increases in time than the pdf will 
assumes increasingly smaller values furhter compromising the validity of Eq. ( fTB"l) . This 
effect is particularly drammatic when the stochastic process has infinite second and and/or 
first moment such as the Levy flights and walks considered in Section IIVI 



III. SCALING DETECTION WITH PROBABILITY FLUX ANALYSIS (PFA) 

The rationale for the probability flux analysis (PFA) is to have a method of scaling 
detection that is independent of the binning procedure used to evaluate the histogram for 
the pdf statistically more accurate, and independent of the size of the moments of the 
distribution. Define a constant in the interval 6 e]0, 1[ and xg(t) to be a number such that 

N /*' 1)tfEP( *»^ (14) 

— oo 

where V(x,t) is the cumulative distribution. The value of the variate xg(t) encompasses a 
fraction 9 of the probability density p(x,t). We call PFA any algorithm that at any time 
step t calculates the number xg(t). 

If the scaling condition on the pdf (J3J) is satisfied, 

V(x,t)= [ p{x',t)dx'= [ -^—F ( \dx' = T (-^-] (15) 



P(t) \P(t)J \P(t) 

— oo — oo 

where the function J 7 is the cumulative distribution of the scaled density F. Thus, for a 
scaling pdf the condition (THj) becomes 

° = 7 \M) w ^ xe{t) = z<>m (i6) 
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where zg is a constant. This equation shows that in the case of scaling the motion in time of 
the location xg with a fraction 9 of the trajectories X(t) on its left side is directly proportional 
to the motion described by the scaling function (3(t). In case of algebraic scaling, Eq. 
and Eq. (G2), 

Xg{t) = zgt 5 => \n[xg{t)] = ln{z ) + S ln(t) (17) 

In the case of a constant drift, Eqs. (fT4j) — (fTTj) are valid in the moving reference frame 
centered on ut, or equivalently for the variable xg(t)—ut instead of xg(t). As an example, 
let us consider the stocastic process X(t) to be Browninan motion with no drift, then the 
pdf p(x, t) is a Gaussian function centered on the origin x—0, while the scaling function 
/3(t)=y/i. Consider 0=0.977 that 

is ^0.977 encompasses 97.7% of the distribution. For a 
Gaussian distribution £0.997 correspond to a value in excess of two standard deviation from 
the mean. Thus for a Brownian motion with no drift Xo,gg7(t)=2a\/i which satisfies the 
second relation of Eq. ffT6]) with zg=2a, a being the standard deviation of the increments of 
X(t). 

The numerical calculation of the function xg(t) can be done as follows. At any time t the 
trajectories {ATfc(t)} of the STE defined in Eq. (1121) are placed in ascending order and the 
value xg{t) is assigned to be Xxif) with K=[9 x (N — 1 + 1)], [..] indicating the integer part. 
This procedure requires no binning as the DEA algorhitm, however it can be computationally 
demanding since at any time step t the trajectories {X k (t)} must be sorted. Hereby, we use 
a faster procedure. We estimate xg{t) by calculating the cumulative distribution V(x,t) at 
fixed spatial intervals of length A: the accuracy (resolution) with which the value xg(t) is 
computed. Let {Xk(t)} be the set of trajectories of the STE at time t, and M{t) the total 
number of bins of length A necessary to cover the span of these trajectories, we define xg(t) 
as follows 

l M{t) 

Xg(t) = X\ such that /]pj,A(t) < 9 and PjA^) > ^ (18) 

j=l j=l+l 

where the symbol Pj^{t) indicates the trajectory frequency or histogram within the j-th 
interval [xj — A/2, Xj + A/2]. Calculating the frequencies Pj^it) requires a single sequential 
scanning of the trajectories {Xk(t)} while any sorting algorithm requires more computational 
power. The procedure described by Eq. (1181) is similar to the DEA algorithm (\VS\i as both 
methods require the calculation of Pj,/\{t). However, there is a fundamental difference. For 
the DEA to be meaningful p J5 a(^)/A must accurately reproduce the pdf in each interval. 
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This strong requirement is not necessary for the method described in ({TBI) . The adoption of a 
binning procedure in the case of PFA is just a computational device to speed up calculation. 

IV. COMPARISON BETWEEN PFA AND DE METHOD 

In this section, we compare the results of the PFA and DEA methods applied to a number 
of computer generated sequences having known statistical properties. In addition we apply 



the two techniques to the electroencephalogram (EEG) data set previously analyzed 14]. 



A. Levy Flights 

In this section we generate a number of Levy flights for the stochastic process X(t). These 
processes have stationary delta-correlated increments £ (t) with infinite variance and possibly 
also infinite mean depending on the value of the Levy index selected. The generalized central 
limits theorem predicts the pdf to be, after an initial transient that depends on the specific 
distribution of the increments, a Levy distribution. The function F(y) on the right hand 
side of Eq. fl3]) is a stable Levy distribution whose Fourier transform is given by 

F{k) = exp [ ik^-\ck\ a (l-irj sgn(/c)$) ] . (19) 

In the above equation T{k) is the characteristic function of F(y), 7 G 1Z is the shift param- 
eter, i] g[-1,1] is called the skewness parameter, a measure of asymmetry, and < a < 2 is 
the Levy parameter. Finally, the constant $ is equal to tan(7ra/2) for all values of a except 
for a=l when 3>=(2/7r) log \k\. 

We choose the increments £ of the variable X to be distributed according to an inverse 
power law 

where /iG] 1, +00 [ is the "index" of the inverse power law and B G 1Z is a location parameter 
(£ ^> B oc |£| >1 ). The scaling parameter 5 of the resulting Levy distribution for 

p(x,t), the power law index /1, and the Levy parameter a are connected as follow: 

a = fi — 1 and 5 = — = for//G]l,3[. (21) 

a /J, — 1 

where, of course, a=2 corresponds to the Gaussian distribution, which we do not consider 
here. Random numbers distributed according to fTSOj) can be obtained from random numbers 
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uniformly distributed in the interval ]0,1[ 15] . To test the performance of PFA and DEA on 
Levy flights, we generate 10 7 random numbers distributed according to the inverse power 
law Eq. ( )20l) with location parameter B = 1 and power law index /i = 1.6. The result are 
recorded in figures below. 

We see from the calculation depicted in Figure 1 that both the DEA and PFA methods 
apparently provide reliable estimates of the early time scaling of the stochastic process, even 
though we have only used the first 25% of the data in the use of PFA. The slope of the 
curves in both calculation is 1.6, the expected scaling index 5 in (1211) . However after two 
decades the DEA curve begins to run out of statistics, whereas the PFA persists for another 
decade and one-half before it markedly deviates from the theoretical curve. The detailed 
divergence between the two calculations is evident in the insert. 
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FIG. 1: Figure 1: Probability Flux Analysis (PFA) and the Diffusion Entropy Analysis 
(DEA) are graphed for 10 7 computer-generated data points using (1201 with a Levy flight of 

scaling index 5 = 1.6(/i = 1.6). 



In Figure 2a the pdf is depicted at times t = 50 and t = 10 3 , with the latter vertically dis- 
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placed for visual clarity. In the vicinity of the first time both the DEA and PFA calculations 
have the same slope as that of theory, whereas in the vicinity of the latter time the DEA 
calculation significantly deviates from the theoretical curve and the PFA calculation does 
not. The pdf at t = 50 loses its crispness for large values of the variate where the process 
becomes undersampled and therefore fluctuates significantly from value to value. On the 
other hand, the pdf at t = 10 3 is undersampled throughout the domain of the distribution 
and therefore the weight of successive values of the histogram are quite noisy. This is what 
is meant by "running out of statistics" in the DEA calculation in Figure 1; there is insuffi- 
cient statistics to reduce the noise in the histogram. The DEA calculation therefore is more 
useful at early times where the histogram for the pdf is more reliable. At latter times the 
larger values of the variate become increasingly more important and the histograms becomes 
increasingly less reliable. However, this lack of reliability in the histograms has no apparent 
influence on the PFA since slope determination with this latter method is done only using 
those values of the variate below the percentage cut off 6 and not on a faithful reproduction 
of the pdf in the entire range of the variate. 

Another way of comparing the distribution at different times is to examine the survival 
probabilities. Levy distributions (with the exception of the Gaussian case a — 2) have 
inverse power-law tails with index a + 1. Thus, the corresponding survival probability has 
an inverse power-law tail with index a. In Figure 2b, we see that at t = 50 the slope of 
the inverse power-law survival probability (a = /i — 1 = 0.6 in our case) coincides with the 
theoretical curve over multiple decades of variate values. The extended inverse power-law 
region of the survival probability is indicative of the quality of the pdf depicted in Figure 2a. 
On the other hand, the t = 10 3 survival probability does not have a region that coincides 
with the theoretical inverse power law for any values of the variate. This lack of scaling 
is consistent with what is observed from the DEA calculation in Figure 1. The empirical 
survival probability is not sufficiently robust to detect the scaling in the data at late times. 
However the survival probability is statistically robust at the 1 — 9 level and thus the PFA 
method continues to detect the scaling. 

To drive home this point it is useful to examine how the PFA depends on the choice 
of 9. In Figure 3 we plot the residue of the PFA analysis for four widely spaced values of 
the fraction 9 using the same computer-generated data of Figure 1 and 2 ( a Levy flight 
index a— 1.6 ; ji—1.6). The calculation tracking the theoretical slope for the longest time 
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FIG. 2: Figure 2: (a) The pdf is given at times t = 50 and 10 3 .It is evident that p{x,t) is 
better at earlier times indicating that this is where the DEA will be most useful, (b) The 
survival probability at the two times is depicted. At t = 50 the survival probability is an 
inverse power law for more than two decades, whereas at t = 10 3 there is essentially no 
region where a theoretical inverse power law is detected. 
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t = 10 4 has 9 = 0.05. As the "time" t increases the number of trajectories N — t + 1 
available in the STE decreases. Therefore the larger 9 the earlier in time the cumulative 
(survival) probability {1 — 9) becomes statistically unreliable. The divergence of the first and 
second moments in the Levy flight considered here makes this loss of statistical robustness 
even more dramatic. A similar effect will also occur if very small values of 9, for example, 
9 « 0.01, are adopted. In fact, although the support of the pdf is [0, oo], the left border 
of the support of its numerical approximation can be several orders of magnitude larger as 
shown in Figure 2 
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FIG. 3: Figure 3: The PFA calculation is graphed for 10 7 computer-generated data points 
using ( 120]) with a Levy flight of index a = 1.6(/i = 1.6) for a number of different values of 9. 

B. Levy walks 

A random walk is a stochastic process X(t) where the distance covered by the walker, the 
value of the variate, in a finite time is limited. The restriction on the walk is a consequence 



13 



of the physical relation between taking a step of a given size and the time required to take 

such a step [7] . Hence, at any given time t the walk pdf is bounded and all the moments are 

finite. A Levy walk has a pdf between the fastest walkers that is approximately equal, after 

a transient, to a Levy distribution. Levy walks can be generated in a number of different 

I T 

ways, for example, using chaotic intermittent maps [16| . or, as we will do herein, using 
inverse power-law distributed random numbers. We consider a type of Levy walk called 
the Symmetric Velocity Model (SVM) 161 ] . In the SVM walk, the velocity of the random 
walker can only assume two values/states, here taken to be ±1. The velocity of the walker 
remains in a given state for an interval of time of random duration r distributed according 
to the waiting-time distribution density ip(r) given by the inverse power-law distribution 
( 1201) with index fi £ [2,3]. After waiting in a given state for a time r, that is, traveling 
at a constant velocity for the specified time interval, a coin is tossed to determine the new 
value of the velocity; and a new r is extracted from the distribution ip{ T ) to determine the 
duration of this new velocity value. The pdf of the SVM walk at time t is bounded between 
[—£,£] and can be approximated (t » mean time of V^)) by the scaling expression Eq.(|3]) 
with F(y) being a symmetric Levy stable distribution with shift parameter given by 7 = 0, 
the skewness par ameter rj = 0, Levy index a = fi — 1, and by using the scaling function 



P(t) = t l / a Urn- 

For the numerical implementation of the SVM walk, we extract 10 6 random waiting times 
r according to the inverse power-law distribution with index \x = 2.5 and location parameter 
B = 1. Then we consider the transformation r — > [t] + 1 where [.] is the integer part of the 
term in brackets. This transformation creates a sequence of integers {r^ > 0}, which are 
inverse power-law distributed with the same index /z and approximately the same location 
parameter B of the original sequence. To assign the velocity states we generate a 10 6 long 
sequence of random coin tosses: {vj = ±1}. Finally we use the couples {tj, Vj} to create 
the sequence £j of increments (the velocity of the walker) of the stochastic variable X and 
consequently the sequence X,, which is processed using the DEA algorithm. 

In Figure 4 the PFA calculations for the four different values of 9 are compared with 
the DEA calculation. After an initial transient (t ~ 10 2 ), the DEA calculation tracks the 
theoretical straight line (S(t) oc 5lnt with 5 = — 1) = 0.6) up to a time t < 10 4 after 
which the DEA begins to run out of statistics. The pdf of the SVM walk is symmetric, 
however the numerical evaluation may not be symmetric, and at any time step we subtract 
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the numerical mean before calculating the value of xg (t) , which satisfies Eq. (114"]) . Moreover 
due to the symmetry of the SVM walk xg =05 (t) = 0, and xg 1 {t) = —xg 2 {t) if \9 1 — 0.5| = 
\02 — 0.5| , that is, if the two different values of the parameter 9 are symmetric with respect 
to 0.5. Therefore, we limit ourselves to doing the PFA calculations for 9 > 0.5. The results 
depicted in Figure 4 indicate that PFA tracks the therortical straight line for an additional 
decade beyond that of DEA. The results are approximately independent of the particular 
value of the cut off fraction 9, although larger values show less wiggly behavior than do 
smaller ones; an effect seemingly at odds with those found in the previous case of the Levy 
flight. The rationale for this effect is the following. For the SVM Levy walk, £0=0.5 (£) — 
for all t, however for the numerical calculation this is not true and a plot of £0=0.5 (t) reveals 
a fluctuating value. The fluctuations increase in intensity as the time t increases since the 
number of trajectories in the STE decreases as N—t+1, and the pdf becomes less symmetric. 
The cut off location xg(t) for values of 9 closer to 0.5, such as 0.6 are more affected by this 
type of noise than the cut off location for larger values of 9 such as 0.9. The overall effect 
is that although in theory xg=os{t) should be more robust than xg=o,g{t), in practice, the 
latter is "crisper" than the former. 



C. EEG records 



In the previous two subsections we compared the results of using the DEA and PFA 
methods to determine the known scaling properties of computer-generated data sets having 
diverging and finite second moments, respectively. Now we turn our attention to experi- 
mental data sets whose unknown scaling properties we wish to determine. One such data 
set of both historic and contemporary interest is that of the electroencephalogram (EEG) 
depicting the erratic dynamics of the human brain. The observed scaling in EEG time series 
is not as straightforward as observed in other less complex phenomena. Various measures 
other than the standard deviation and spectrum have been introduced into the study of EEG 
time series, each one stressing a different physiologic property thought to be important in 
representing the brain's dynamics. Most recently the DEA method has revealed a rather 
remarkable behavior of the single channel EEG time series. Specifically the failure of the 
EEG signal to scale: e.g.: Hwa et al. 18] applied DEA to the series of increments of EEG 
increments and found a bi-scaling regime. Ignaccolo et al. 14j argues that the EEG signal 
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FIG. 4: Figure 4: The DEA and PFA calculations for a Levy walk for four values of the 
fraction of the data included in the calculation. 



during resting activity can be modeled using a dissipative linear dynamic process X(t), i.e., 
an Ornstein-Uhlenbeck process, with a quasi-periodic driver having a random amplitude and 
frequency and an additive random force r] (t) which is a delta correlated Gaussian process 
of strength a. Latka et al. shows how the model proposed in [14] explains the bi-scaling 
regime observed by Hwa et al [3] and why this is not a "real" algebraic scaling regime 
(satisfying Eqs. ([3]) and ([5])) but is an artefact of DFA. 

Fig. [5] shows the DEA for an EEG channel under the closed eyes resting condition. The 
time t is expressed in seconds with 1 second corresponding to 250 data samples. The observed 
saturation is the results of dissipative linear dynamics while the decaying oscillations are 
produced by the random periodic forcing: the alpha rhythm which is the well known wave 
pattern present in EEG under the closed eyes resting condition. Different trajectories of 
the STE (fl2|) experience wave packets of different amplitude and frequency. The typical 
duration of a wave packet is ~0.5s [14J. The mixture of wave packets results in a pattern of 
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destructive interference. The larger the time t, the wider the spectrum of different amplitudes 
and frequencies present in the trajectory X^(t) of Eq. ffT2l) . and the more intense is the 
interference. This mechanism explains the observed decaying oscillation for the information 
entropy S(t). The apparent period ~0.013s is just the amplitude weighted average of a— wave 
packet periods occurring in the particular EEG record examined. Also plotted in Figj5] is 
the PFA for different values of the cut off fraction 9. We see how PFA reproduces all the 
characteristic observed for DEA. Note that EEG records are almost symmetric so that the 
PFA analysis for #<0.5 is just the mirror image of the one shown, as is the Levy SVM walk 
of the previous section, of the one relative to 6>0.5. This also explain why the results are 
increasingly "crisp" when moving from 0=0.6 to 6=0.9 in Fig. 5. 
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FIG. 5: The DEA processing of the EEG data from a single channel in the occipital lobe is 
labeled S(t). The equivalent PFA processing of the same data using four different values 9 

is also shown. 
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V. CONCLUSION 

The PEA calculates the "cut off" location xg which encompasses a fraction 9 of the 
pdf of the stochastic process X(t). As the pdf evolves in time so does xg in order to 
always encompass the same fraction of the distribution. In this sense, PEA is a "volume" 
preserving transformation. The volume preserving transformation for a scaling distributions 
is, aside from a constant multiplicative factor, the scaling function itself (Tl6|) . Therefore PFA 
can be a scaling detection tool. The power of this method is that the volume preserving 
transformation is statistically more robust than any method based on a detailed knowledge 
of the pdf such as DEA. The PFA with cut off fraction 9 is as statistically robust as the 
cumulative distribution V(x,t) at level 9 or the survial distribution (1 — V(x,t)) at level 
(1 — 9). For a bounded signal with a "well-behaved distribution" (no inverse power law 
with infinite first or second moment) such as the EEG data the statistical advantage of 
the PFA over DEA may not be so evident (Fig. [5]). However, as soon as we depart from 
the realm of well-behaved distributions the statistical advantage of PFA becomes apparent. 
For the SVM Levy walk the agreement with the theoretical curve is extended by one decade 
(Fig. Hj). SVM Levy walks have bounded pdf's but a non well-behaved distribution is hidden 
in this stochastic process: the pdf of the waiting times ( 120|) with index /xe]2,3[, which has an 
infinite variance, and is the reason why a Levy distribution appears in the region between 
the bounding sites. The statistical advantage of PFA over DEA is even more evident in 
the case of Levy flights where the pdf is unbounded with infinite first and second moments. 
In this latter case, almost two decades are gained using one method rather than the other 
(Fig. [Hand E]). 

Aside from scaling detection, the results of Fig. [5] relative to the EEG record, where 
there is no algebraic scaling, shows that PFA can be used to investigate the dynamics of 
a stochastic process X(t). In fact, the details of the dynamics generating the signal X(t) 
are somewhat reflected in the volume preserving transformation performed by PFA. From 
these results the inescapable conclusion is that the PFA method is superior to the DEA and 
ought to replace it. Moreover, since we have shown elsewhere [l^] that DEA is preferable 
to detrended fluctuation analysis (DEA), we must further conclude that PFA replace DFA 
as well. 
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